EDGE DISTRIBUTION AND DENSITY 
IN THE CHARACTERISTIC SEQUENCE 
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Abstract. The characteristic sequence of hypergraphs {Pn : n < u>) associated to a for- 
mula ip{x\y), introduced in [3, is defined by Pn{yi,---yn) = (3a;) Ai<„ '/'(a;; This 
paper continues the study of characteristic sequences, showing that graph-theoretic tech- 
niques, notably Szemeredi's celebrated regularity lemma, can be naturally applied to the 
study of model-theoretic complexity via the characteristic sequence. Specifically, we relate 
classification-theoretic properties of ip and of the P„ (considered as formulas) to density 
between components in Szemeredi-regular decompositions of graphs in the characteristic 
sequence. In addition, we use Szemeredi regularity to calibrate model-theoretic notions of 
independence by describing the depth of independence of a constellation of sets and showing 
that certain failures of depth imply Shelah's strong order property SOP3; this sheds light 
on the interplay of independence and order in unstable theories. 



1. Introduction 

The characteristic sequence {Pn : n < tu) is a tool for studying the combinatorial complex- 
ity of a given formula (p, Definition 12.21 below. It follows from [5], [6] that the Keisler order 
[3] locahzes to the study of v^-types and specifically of characteristic sequences. However, 
this article will not focus on ultrapowers. 

The analysis of [H] established that characteristic sequences are essentially trivial when 
the ambient theory T is NIP, Theorem 12.81 below. In this article, we turn to the study 
of characteristic sequences in the presence of the independence property. The framework 
of characteristic sequences allows us to bring a deep collection of graph-theoretic structure 
theorems to bear on our investigations. Notably, the classic model-theoretic move of polar- 
izing complex structure into rigid and random components (e.g. Shelah's isolation of the 
independence property and the strict order property in unstable theories) is accomplished 
here by the application of Szemeredi's Regularity Lemma, §1] Theorem [B] below. Because 
the Regularity Lemma describes a possible decomposition of any sufficiently large graph, it 
can be applied here to understand how arbitrarily large subsets of Pi generically interrelate. 

In Sections IMSl we investigate how classic properties of T affect the density 6 attained 
between arbitrarily large e-regular subsets A,B C Pi (after localization) in the sense of 
Szemeredi regularity, where the edge relation is given by P2. The picture we obtain is as 
follows. When ip is stable, by Theorem 12. 8[ the density (after localization) is always 1. 
When if is simple unstable, after localization, there will be an infinite number of missing 
edges but we can say something strong about their distribution: (*) the density between 
arbitrarily large e-regular pairs must tend towards or 1 as the graphs grow (indeed, here 
simplicity is sufficient but not necessary). In the simple unstable finer function 

counting the number of edges omitted over finite subgraphs of size n is meaningful, and 
we give a preliminary description of its possible values in Proposition 13.91 In Section O we 
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use model theory to relate the property (*) of having arbitrarily large e-regular subsets of 
Pi with edge density bounded away from and 1 to the phenomenon of instability in the 
characteristic sequence, which is strictly more complex than failure of simplicity. In Section 
E] we refine this phenomenon by defining and investigating the compatible and empty order 
properties. On the level of theories, the compatible order property characterizes the model- 
theoretic rigidity property SOP3, which is known to imply maximality in the Keisler order 

by 0. 

In the other direction, in Section [7] we use Szemeredi regularity to bring to light a subtle 
model-theoretic failure of randomness, by considering the "depth of independence" of a 
constellation of infinite sets. In the language of Definition \7.2\ we show that theories which 
are /"^^ but not I^^l for some n > 2, are SOP3. This is a result about the fine structure 
of the classic SOP/IP distinction, illustrating the tradeoff between a weaker notion of strict 
order {SOP3) and a stronger notion of independence (/^+^) in unstable theories. 

Acknowledgements. Thanks are due to my advisor Thomas Scanlon, and to Leo Harring- 
ton, for many stimulating conversations, as well as to Scanlon's NSF grant for funding a trip 
to the ICM in Madrid where I first learned of Szemeredi's work. Thanks also to Laci Babai 
for a copy of the helpful survey [4J- 

2. Preliminaries 
The following conventions will be in place throughout the article. 

Convention 2.1. (Conventions) 

(1) // a variable or a tuple is written x or a rather than x,a, this does not necessarily 
imply that i{x),i{a) = 1. 

(2) Unless otherwise stated, T is a complete theory in the language C. 

(3) A set is /c-consistent if every k- element subset is consistent, and it is fc-inconsistent 
if every k- element subset is inconsistent. 

(4) (pe{x; yi,...ye) := f\i<i V^ix; Vi) 

(5) 5'ho(co') is the set of all finite subsets ofoj. 

(6) e, 5 are real numbers, with < e < 1 and < 5 < 1 . 

(7) Let G be a symmetric binary graph. We present graphs model-theoretically, i.e. as 
sets of vertices on which certain edge relations hold. Throughout this article R{x, y) is 
a binary edge relation, which will sometimes (we will clearly say when) be interpreted 
as P2. 

(8) A graph is a simple graph: no loops and no multiple edges. Definition \2.2\ below 
implies that'ix{Pi{x) —>■ P2{x,x)), but we will, by convention, not count loops when 
taking P2 as R. 

(9) Given a graph G, with symmetric binary edge relation R{x,y): 

• \G\ is the size ofG, i.e. the number of vertices. 

• e{G) is the number of edges of G. 

• e{G) is the number of edges omitted in G. 

• An empty graph is a graph with no edges. 

• A complete graph is a graph with all edges, i.e. in which x,y & G,x ^ y =^ 
R{x,y). 

• The degree of a vertex is the number of edges which contain it. 
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• The dual graph G' has the same vertices and inverted edges, i.e. for x ^ y, 

(10) Write {X,Y) to indicate a a bipartite graph. Then: 

• e{X, Y) is the number of edges between elements x G X and y eY . Note that if 
G = AU B then possibly e{G) ^ e{A,B), as the latter counts only edges between 
A and B. 

• e(X, Y) is the number of edges omitted between elements x G X and y eY . 

• The density of a finite bipartite graph (X, Y) is (5(X, Y) := e(X, F)/|X||F| when 
|X|, |F| 7^ 0, and otherwise. 

• An empty pair is a pair of vertices x, y with -iR{x, y). 

• An infinite empty pair is {X,Y) such that \X\ = \Y\ > Kq and for all x E X, 
y eY, we have -^R{x,y). 

• A complete bipartite graph is (X, Y) such that for all x E X,y E Y , R{x, y). 

• The dual (X, Y)' of a bipartite graph inverts precisely the edges between the 
components X and Y . 

We will make extensive use of the important classification-theoretic dividing lines of sta- 
bility, simplicity, the independence property, and the strict order property; see, for instance. 
Chapter II, sections 2-4 and f8j. A theory or a formula is NIP, also called dependent, if 
it does not have the independence property; see, for instance, [TT] . 

We now turn to definitions. The characteristic sequence of hypergraphs was introduced in 
[6j as a tool for studying the complexity of a given formula y?. Let us set the stage by briefly 
reviewing some of the results obtained there. 

Definition 2.2. (Characteristic sequences) Let T be a first-order theory and ip a formula of 
the language ofT. 

• Forn< u, P„(2i,...z„) := 3x /\.^^ip{x; Zi) . 

• The characteristic sequence of (p in T is {Pn :n < u). 

• Write (T, (p) i— (P„) for this association. 

• We assume that T h \/y3z^x{ip{x; z) -^(p{x;y)). If this does not already hold for 
some given ip, replace (p with 9{x; y, z) = ip{x; y) A ^^pi^x] z). 

Convention 2.3. As the characteristic sequence is definable in T, its first-order properties 
depend only on the theory and not on the model of T chosen. Throughout this paper, we 
will be interested in whether certain, possibly infinite, configurations appear as subgraphs 
of the Pn- By this we will always mean whether or not it is consistent with T that such 
a configuration exists when P„ is interpreted in some sufficiently saturated model. Thus, 
without loss of generality the formulas Pn will often be identified with their interpretations 
in some monster model. 

Characteristic sequences give a natural context for studying the complexity of y^-types, 
which correspond in this case to complete graphs. 

Definition 2.4. Fix T,ip, M \=T and {T,ip) ^ (P„). 

(1) A positive base set is a set A G Pi such that A" C P„ for all n < uj. 

(2) The sequence (Pn) has support k if: Pn{yi, ■ ■ .yn) iff Pk holds on every k-element 
subset of {yi, . . .yn}- The sequence has finite support if it has support k for some 
k < UJ. 
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(3) The elements Oi, . . . G Pi are a k-point extension of the Poo-complete graph A just 
in case Aai, . . .Ok is also a Poo-complete graph. 

Observation 2.5. Fix T, ip and M \=T and suppose (T, ip) {Pn) ■ 

(1) The following are equivalent, for a set A C M: 

(a) A is a positive base set. 

(b) The set {(p{x; a) : a E A} is consistent. 

(2) The following are equivalent, for a set A C Pi: 

(a) A" n P„ = for some n. 

(b) {ip{x;a) : a E A} is 1-consistent but n-inconsistent ( Convention \2. 1^ 2)). 
Note that if A is infinite, compactness then implies some instance of p divides. 

(3) The following are equivalent: 

(a) (P„) has finite support. 

(b) if does not have the finite cover property. 

Localization is a definable restriction of the predicates P„ of a certain useful form which 
eliminates some of the combinatorial noise around a positive base set A under analysis. 
Definability ensures that Convention 12.31 applies when asking whether certain configurations 
are present in some localization. 

Definition 2.6. (Localization, Definition 5.1 of [6j) Fix a characteristic sequence {T,ip) — > 
(Pn), cind choose B, A (Z M \= T with A a positive base set, possibly empty. A localization 
P^ of the predicate Pn{yi, ■ ■ - yn) around the positive base set A with parameters from B is 
given by a finite sequence of triples f : m ^ u x V^^^iyi, ■ ■ ■ yn) x 'Pno{B) where m < uj and: 

• writing f{i) = {ri,ai,l3i) and s for the elements of the set s, we have: 

Pi{yi,...yn) ■■= A Pni'^iJi) 

i<Tn 

• for each i < u, Ti implies that there exists a Pe-complete graph Ce such that Pi 
holds on all n-tuples from Ce. If this last condition does not hold, is a trivial 
localization. By localization we will always mean non-trivial localization. 

• In any model ofTi containing A and B, P/ holds on all n-tuples from A. 

For the purposes of this article, we will indicate where localization is useful without, 
generally, specifying the parameters or the form involved, writing simply "there exists a 
localization in which..." or "after localization..." for short. Because this may always be 
taken to include a fixed positive base set, the essential complexity of the type under analysis 
is not lost. Localization reveals a gap in the classification-theoretic complexity of (p and of 
P2. §3 below will shed light on this result: 

Conclusion 2.7. (Conclusion 5.10 of [6J) Suppose T is simple, (T, ip) \—>- (Pn) . Then for any 
n < UJ, and any partition of yi, . . .yn into object and parameter variables, after localization 
the formulas P2(l/i, y2), ■ ■ ■ Pnivi, ■ ■ ■ Un) do not have the order property. 

It turns out that when ip is NIP one can always localize (without losing sight of the 
positive base set A under analysis) so that any given finite initial segment of the characteristic 
sequence is a complete graph. In other words, the characteristic sequence is non-trivial in 
the presence of the independence property. 
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Theorem 2.8. (Theorem 6.17 of [6]) Let (f be a formula of T and (Pn) its characteristic 
sequence. 

(1) If ip is NIP, then for each positive base set A G Pi and for each n < uj, there exists 
a localization P{" D A of Pi which is a Pn-complete graph, i.e. {yi, . . .?/„} C 

PniVl, ■■■Vn)- 

(2) If (f has IP, then for all n < u. Pi contains a Pn-empty tuple. 

Furthermore, when cp is simple unstable we may assume that after localization, in any 
given finite initial segment of the characteristic sequence, there are uniform finite bounds on 
the size of empty graphs. 

Theorem 2.9. (Theorem 6.24 of [6]) Let (p be a formula of T and (Pn) its characteristic 



(1) If is simple, then for each P^o-graph A G Pi and for each n < uj, there exists a 
localization P{" D A of Pi in which there is a uniform finite bound on the size of 
a Pn-empty graph, i.e. there exists rrin such that X G Pi and X" fl P„ = implies 
\X\ < rrin. 

(2) // (p is not simple, then for all but finitely many r < u. Pi contains an infinite 
(r + 1) -empty graph. 

The stage is now set as follows. The characteristic sequence of hypergraphs are a sequence 
of incidence relations defined on the parameter space of a formula (p. Positive base sets 
correspond naturally (though not necessarily uniquely) to base sets for ip-tjpes. We turn 
to the study of the generic interrelationships between sets generally, and positive base sets 
particularly, in the parameter space of a given ip. Theorem 12.81 strongly focuses our attention 
on the "wild" case of theories with the independence property and Theorem 12.91 suggests 
simple unstable theories first object of study. 

3. Counting functions on simple Lp 

Throughout this section, we consider the binary edge relation P2 from the characteristic 
sequence of (p. The notation and vocabulary follow Convention 12. 1[ 

Observation 3.1. Suppose (p is stable. Then after localization, for any two disjoint finite 
X,Y G Pi, 6{X,Y) = 1. On the other hand, if ip is simple unstable then Pi contains an 
empty pair. 

Proof. Theorem 12.8( 1) says that when ip> is stable, after localization Pi is a complete graph, 
so a fortiori there are no edges omitted between disjoint components. The second clause is 



Definition 3.2. Define a : uj ^ uj to be 

max {e(X) : X C Pi, |X| = n} 

i.e. the largest number of P2- edges omitted over an n-size subset of Pi. 

Observation 3.3. Suppose ip is simple, i.e., ip does not have the tree property. Then after 
localization a{n) < "^""'''^ . 



sequence. 




□ 
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Proof. The maximum possible value "^"^""^^ of any a{n) is attained on a P2-enipty graph, on 
which X ^ y => -iP2{x,y). Apply Theorem 12.91 which says that when (p does not have 
the tree property then we have, after localization, a uniform finite bound k on the size of a 
P2-enipty graph X C Pi. So the function a is eventually strictly below the maximum. □ 

Corollary 3.4. The function a{n) is meaningful, i.e. after localization 

nin — 11 



> a{n) > 
precisely when ip is simple unstable. 

With some care we can easily restrict the range further. A famous theorem of Turan says: 
Theorem A. (Turan, ^:Theorem 2.2) If Gn is a graph with n vertices and 

1 \ n2 



^(^)>l^- k-lj 2 
then Gn contains a complete subgraph on k vertices. 

Definition 3.5. X = {aj : t < 2,i < uj) G Pi is an {uj, 2)-array if for all n < uj, 

Pniall, . . . al"J ^ (Vj, i < n) {i, =h ^ tj = t,) 

Claim 3.6. (Claim 4.5 of [6j) The following are equivalent, for a formula ip with character- 
istic sequence {Pn)-' 

(1) ip has the independence property. 

(2) {Pn) has an {u,2)-array. 

Observation 3.7. Suppose that (Pn) has an {uj^ 2) -array. Then a{n) > [|J . 
Corollary 3.8. When p is simple unstable, then after localization 



k-1 2 



i2 



> a n) > 



Proof. If p is simple unstable, p has the independence property and so Pi contains an 
{uj, 2)-array; so the righthand side is Observation 13.71 For the lefthand side, let > 1 be the 
uniform finite bound on the size of an empty graph from Theorem 12.91 and apply Turan's 
theorem to the dual graph. □ 

At the end of Section H] we will give a proof of the following: 

Proposition 3.9. When p is simple unstable either 



1 \ , , 



> a{n) > — or 0{n ) > a{n) > 



n 



l-kj 2 - ^ ' - A ^ ' ^ ' - 12 

The proof will follow from Theorem 14.81 below, which will show more, namely that for p 
simple unstable, either 0{n'^) > a{n) or there exists an infinite empty pair in Pi. 

Our strategy is going to be to show that in the absence of such an "empty pair" we can 
repeatedly partition sufficiently large graphs into many pieces of roughly equal size in such 
a way that, at each stage, the bulk of the omitted edges must occur inside the (eventually, 
much smaller) pieces. The main tool will be Theorem [B] below. 
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4. SZEMEREDI REGULARITY 



We begin with a review of Szemeredi's celebrated regularity lemma. Recall that e, 6 are 
real numbers, < e < 1 and < 5 < 1, following Convention 12.11 

Definition 4.1. [TU], [1] The finite bipartite graph {X,Y) is e-regular if for every X' C X, 
Y' CY with \X'\ > e\X\, \Y'\ > e\Y\, we have: \6{X,Y) - 6{X' ,Y')\ < e. 

The regularity lemma says that sufficiently large graphs can always be partitioned into a 
fixed finite number of pieces Xi of approximately equal size so that almost all of the pairs 
{Xi,Xj) are e-regular. 

Theorem B. (Szemeredi's Regularity Lemma [1], ^Q\) For every e, mo there exist N = 
A^(e, mo), m = m(e, mg) such that for any graph X, N < \X\ < Kq, for some mo < k < m 
there exists a partition X = Xi U ■ ■ ■ U Xk satisfying: 

• I \Xi\ — \Xj 1 1 < 1 for i,j<k 

• All but at most ek"^ of the pairs {Xi,Xj) are e-regular. 

One important consequence is that we may, approximately, describe large graphs G as 
random graphs where the edge probability between Xi and xj is the density dij between com- 
ponents Xi,Xj in some Szemeredi-regular decomposition. We include here two formulations 
of this idea from the literature, the first for intuition and the second for our applications. 

Theorem C. (from Gowers [2j) For every a > and every k there exists e > with 
the following property. Let Vi, . . .Vk be sets of vertices in a graph G, and suppose that 
for each pair {i,j) the pair {Vi,Vj) is e-regular with density 6ij. Let H be a graph with 
vertex set {xi,...Xk) and let Vi G Vi be chosen uniformly at random, the choices being 
independent. Then the probability that viVj is an edge of G iff edge of H differs 

from n^,:r^g/f5ijn^.,^^,^//(l - 6ij) by at most a. 

The formulation we will use. Theorem [D|, requires a preliminary definition. 

Definition 4.2. [4J (The reduced graph) 

(1) Let G = Xi, . . . Xfc be a partition of the vertex set of G into disjoint components. 
Given parameters e, 6, define the reduced graph R{G, e, S) to be the graph with vertices 
Xi {1 < i < k) and an edge between Xi,Xj just in case the pair {Xi,Xj) is e-regular 
of density > 6. 

(2) Write R{t) for a full graph of height t whose reduced graph is R, i.e., R{t) consists of 
k components Xi, . . . Xk, each with t vertices, such that e(Xj) = 0, and (5(Xj, Xj) = 1 
iff there is an edge between Xi and Xj in R. 

The following lemma (called the "Key Lemma" in |4j) says that sufficiently small subgraphs 
of the reduced graph must actually occur in the original graph G. 

Theorem D. (Key Lemma, [l]:Theorem 2.1) Given 6 > e > 0, a graph R, and a positive 
integer m, let G be any graph whose reduced graph is R, and let H be a subgraph of R{t) 
with h vertices and maximum degree A > 0. Set d = 6 — e and eo = d^/{2 + A). Then if 
e < cq and t — 1 < e^m, then H G G. Moreover the number of copies of H in G is at least 
{eomY. 
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Remark 4.3. In the statement of the Key Lemma, "H C G" means that there is a bijection 
f : H ^ X G G such that e{hi, implies e{f{hi), /(/i2)). With some slight modifications 
(recording whether a missing edge in the reduced graph means the density is near or the 
pair is not regular; and using the dual graphs when necessary) we may assume "H G G" has 
the usual meaning of isomorphic embedding, but this will not be an issue for the arguments 
in this section. 

We now work towards a proof of Proposition I3.9[ 

Convention 4.4. (Interstitial edges, b^^£, N^^i, E^^i) 

(1) Let G be a graph and let G = Xi U ■ ■ ■ U X„ be a decomposition into disjoint 
components, for instance as given by Theorem Call any edge between vertices 
X E Xi, z G Xj, i j an interstitial edge. 

(2) Let b^^i denote the upper bound on the necessary number of components, given by the 
regularity lemma as a function ofe,i. 

(3) Write (e, £)*-decomposition to denote any Szemeredi-regular decomposition into k 
components, for any i < k < fe^ ^. 

(4) Let N^/ denote the threshold size given by the regularity lemma as a function of e, i, 
such that any graph X with \X\ > N^^i admits an {e, (i)* -decomposition. 

(5) Let E^j G uj (E for "exactly") be the (possibly empty) set of cardinalities n for 
which \X\ = n implies X admits a Szemeredi-regular decomposition into exactly £ 
components. See the next Remark. 

Remark 4.5. On Definition l4.4( 2)-(4): the Regularity Lemma, along with the pigeonhole 
principle, implies that for cofinally many I, E^^e is infinite. Often, as Corollary l4.7( 3) suggests, 
for the purposes of our asymptotic argument it is sufficient to know that the number of 
components fluctuates in a certain fixed range, as given by the Regularity Lemma. 

An easy application of the Key Lemma shows that: 

Observation 4.6. Suppose that there exists 6 , < 6 < 1 such that for all < e < 1 and 

all N gN there exist disjoint subsets X^, G Pi, \Xn\ = \Yn\ > N such that {X^, Y^) is 
e-regular with density S. Then Pi contains an infinite empty pair. 

Proof. Apply the Key Lemma to each dual graph (X^r, Yj^)', which is still regular and whose 
density remains bounded away from and 1. For each t < uj, for all N sufficiently large, 
{X]\f, Yjy)' contains a complete bipartite graph on t vertices, as this occurs as a subgraph of 
R{t). □ 

Lemma 4.7. Suppose that Pi does not contain an infinite empty pair. 

(1) There is a function / : (0, 1) x u; — (0, 1) monotonic which approaches 1 as e ^ 
and N ^ oo such that if {X,Y) is an e-regular pair with \X\ = \Y\ = n then 
d{X,Y)>f{e,N). 

(2) There is a function g : ((0,1) x u) x u (0)1)? which is defined on all {{e,i),n) 
for which n > N^^i, and which is monotonically increasing and approaches 1 as (e, t) 
stays fixed and n oo, such that if \X\ = n then the density between any two regular 
components in an {e, i)* -decomposition of X is at least g{{e,i),n). 

(3) For every constant c> 0, and for all eg > 0, there exist < e < cq and for each such 
e, cofinally many i < u such that: for all n sufficiently large and all graphs X with 
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|X| = n, the number of interstitial edges in any {e^t}* -decomposition of X is strictly 
less than cv? . 

Proof. (1) This restates Observation 14.61 

(2) The regularity lemma provides a decomposition in which all components are approxi- 
mately the same size (±1), so the density of each e-regular pair will be at least /(e, ^). 

It remains to prove (3). For the moment, let e,£ be arbitrary and suppose that |X| > 
N^^g. Then |X| = n admits an e-regular decomposition into at /c-many pieces, each of size 
approximately m = |, where {]) i < k < i' := b^£. 

Writing 6 := g{{e,i), f), the contribution of the interstitial edges is at most: 

efcW + (l-e)(A;)2(l-(5)m2 

where the term on the left assumes the irregular pairs are empty, and the term on the right 
counts the expected number of interstitial edges missing from the regular pairs. By (f), this 
in turn is bounded by: 

< e(0W + (l-e)(£')'(l-5) 

Thus our claim will hold whenever e -|- (1 — e)(l — 5) < cjr. To obtain this, choose e > as 
small as desired and i as large as desired. Then 6 is monotonically increasing and approaches 
1 as a function of n by (2), as desired. □ 

We are now prepared to prove: 

Theorem 4.8. When ip is simple unstable, if there does not exist an infinite empty pair 
X,r C Pi, then a{n) < 0{n^). 

Proof. Given a positive real constant cq > 0, choose c, fc,t such that 0<c<l, 
and Co > 2c + ^. Fix a pair (e, such that i > k and {€,i) is one of the cofinally many 
pairs described in Lemma 14.7( 3) for the constant c. Now, for n sufficiently large and any 
\X\ = n, each of the components in an (e, £)*-decomposition of X will have size > N^^£ so 
will themselves admit an (e, £)*-decomposition with few interstitial edges. Repeating this 
argument to an arbitrary depth we can confirm that the bulk of the ostensibly missing edges 
must continually vanish inside the (eventually) relatively much smaller components at each 
successive decomposition. 

More precisely, let £' := b^^i and suppose n » {N^^i){i'y. By repeated application of 
Lemma 14.7( 3) to each successive decomposition, we obtain the following upper bound on 
a{n), where i < ki < £' for each 1 < i < t. The rightmost term assumes that after t — 1 
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levels of decomposition we obtain components which are themselves empty graphs. 



2 / \ 2 



a{n) <cn' + c[—j h + c [j^, ] (^2) + 



1 1 1 \ ^ n2 

<CnUl + — + 7— r + ■ ■ ■ + —r + 



o I ic 1 

<«M — + ^ 

< ( 2c+ j < con^ 



by summing the convergent series. We have shown that for any constant Cq, for all n 
sufficiently large a{n) < c^v? ^ so we finish. □ 

Proof, (of Proposition 13.91) This is now an immediate corollary of Theorem 14. 8[ ^ being the 
number of edges omitted in an empty pair. □ 

Remark 4.9. Theorem \4.S\ and thus Proposition \3.9[ are more natural than might appear. 
On one hand, as Szemeredi regularity deals with density, it cannot (in this formulation) give 
precise information about edge counts below 0{n^). On the other, the random graph contains 
many infinite empty pairs, for instance {{{a, z) : z E M, z ^ a},{{y,a) : y E M,y ^ a}) 
when ip = xRy A -ixRz. One could imagine a future use for such theorems in suggesting 
ways of decomposing the parameter spaces of simple formulas into parts whose structure 
resembles random graphs (with many overlapping empty pairs) and parts whose structure is 
more cohesive, indicated by a{n) < 0{n'^). 



5. Order and genericity 

In this section we show that Pi, after localization, admits arbitrarily large e-regular pairs 
of some fixed density bounded away from and 1 precisely when P2, after localization, is 
unstable. Compare Conclusion [2.71 

Observation 5.1. Suppose that for some < S < 1 and for all e,n with 0<e<l, riGN 
we have a bipartite R-graph {X, Y), \X\ = \Y\ > n, such that (X, Y) is e-regular with density 
d, where \d — 6\ < e. Then R has the order property. 

Proof. It suffices to show that for arbitrarily small eo and arbitrarily large ko there is a 
Szemeredi-regular decomposition of X and of Y into ko pieces such that all but A;o(eo)^ of 
the pairs Xj, Yi are eo-regular with density near 6. This is because we can apply the Key 
Lemma (in light of Remark 14. 3p to conclude that any pattern which appears in the reduced 
graph corresponding to these components, in particular some given fragment of the order 
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property, actually occurs in X, Y. The subtlety is to ensure that the densities of the regular 
pairs are all approximately the same. 

Given eo, k, let ko, Nq be the number of components and threshold size, respectively, given 
by the regularity lemma. Choose e so that ^ > e and n > Nq. Let (X, Y) be the e-regular 
pair of size at least n and density near 6, given by hypothesis. 

By regularity, n > Nq means that there is a decomposition X = Ui^ko^i, Y = ^i<koYi 
into disjoint pieces of near equal size and that all but eo(fco)^ of the pairs (Xj,l^) are eo- 
regular. However any one of these regular pairs (Xj, Yj) will satisfy |Xj|, \Yj\ = n/ko > en, 
so \d{Xi, Yj) - d{X, Y)\ = \d{Xi, Yj) - 5 ± e| < e and |d(Xi, Yj) - 6\ < 2e, as desired. □ 

Recall that an equivalent definition of SOP is that there exists an indiscernible sequence 
{ai : i < uj) on which 3x{-^ip{x] aj) A (p{x] Oj)) <^==^ j < i. The main step in Shelah's classic 
proof that any unstable theory which does not have the independence property must have 
the strict order property can be characterized as follows: 

Theorem E. (Shelah) Let c be a finite set of parameters and {a^ : i < uo) a c-indiscernible 
sequence. Forn<uj, any formula 6 {x;z) and relations R{x;y), Ri, ... Rn where £{y) = i^ai) 
and Ri G {-R(x; y), -^R{x; y)} for i < n, if 

ii < ■ ■■ <i„ =^ 3x (9{x; c) A Ri{x; a^J A ■ ■ ■ A Rnix; UiJ^ 

then either 

• 3x (^9{x; c) A Ri{x; aj^^^j) A ■ ■ ■ A Rn{x] a.V(„)) j for any permutation a : n ^ n 

• some formula of T has the strict order property. 

The idea is to express the permutation cr as a sequence of swaps of successive elements 
(in the sense of the order <), and use the first instance, if any, where the swap produces 
inconsistency to obtain a sequence witnessing strict order. For details, see Theorem 
II.4.7, pps. 70-72. 

The subtlety in the corollary is to obtain not just the independence property but a bipartite 
random graph. 

Corollary 5.2. Suppose that R{x;y) has the order property. If T does not have the strict 
order property, then there exist infinite disjoint sets A, B on which R is a bipartite random 
graph (i.e., R{x;y) is /| in the sense of Definition \ 7. ^ below). 

Proof. We first fix a template. Let M be a countable model of the theory of a bipartite 
random graph with two sorts P, Q and a single binary edge relation E{x; y) with E{x; y) =^ 
P{x) A Q{y). Let {xi : i < u), {yi : i < uj) be an enumeration of P and Q, respectively. 

Now let {ttihi : z < a;) be an indiscernible sequence on which R has the order property, i.e. 
R{ai, hj) <^=^ i < 3- Suppose that for every i < uj we could find an element Cj such that for 
all j < UJ, R{ci, hj) <^=^ E{xi, yj) in the template. Then setting C := IJi<a; ^ ■~ Uj<(^ ^i' 
(C, B) is a bipartite random graph for R. 

So it remains to show that any finite subset p of the type Pi{x) G S{B) of any such 
Ci is consistent. Let ?7, z/ be disjoint finite subsets of u, and let p{x) = R{x;hj) A 
/\k^^^R{x;hk). We are now in a position to apply Theorem \E\ as T is NSOP, p{x) must 
be consistent. □ 
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Definition 5.3. Fix a binary edge relation R. Call a density < S < 1 attainable if for all 

e there exists a sequence {Sf = {{Xi,Yi) : i < uj) of finite bipartite R-graphs such that for all 
n < uj,e > there is N < u such that for all i > N, 

• \Xi\ = \Yi\ > n, 

• is e-regular with density di, where \di — 6\ < e. 

Conclusion 5.4. Assume T does not have the strict order property. Then the following are 
equivalent for a binary relation R{x,y): 

(1) For some < 6 < 1 and for all N,e there exist disjoint X,Y with \X\ = \Y\ > N 
such that {X, Y) is e-regular with density d, \d — 6\ < e. 

(2) For any attainable < 6 < 1 such that for all N, e there exist disjoint X, Y with 
l-^l = 1^1 > ^ such that (X, Y) is e-regular with density d, \d — 6\ < e. 

(3) R has the order property. 

Proof. (2) — > (1) Attainable densities exist, e.g. |: consider subgraphs of an infinite random 
bipartite graph. 

(1) ^ (3) Observation EH 

(3) (1) Corollary 15.21 which says that from (3), assuming NSOP, we can construct an 
infinite random bipartite graph with edge relation R. □ 

In other words, regularity plus compactness implies that density bounded away from 0, 1 
gives any bipartite configuration including the order property, and model theory implies that 
the order property is enough to reverse the argument. 

Corollary 5.5. Assume T does not have the strict order property, and (T,(p) {Pn) ■ Then 
the following are equivalent: 

(1) After localization, P2 does not have the order property. 

(2) After localization, the density of any sufficiently large P2-regular pair {X, Y) must 
approach either or 1. More precisely, there exists / : N x (0, 1) — > [0, |] monotonic 
increasing as n 00, e such that if X,Y C Pi, \X\, \Y\ > n and {X,Y) is 
e-regular, then either d{X, Y) < f{n, e) or d{X, Y) > 1 ~ f{n, e). 

Corollary 5.6. // T is simple, then any characteristic sequence associated to one of its 
formulas satisfies the equivalent conditions of Corollary \5.5[ 

Proof. Conclusion 12. 7[ □ 

Remark 5.7. The class of theories satisfying the equivalent conditions of Corollary 15.51 
strictly contains the simple theories. Example 3.6 o/[6] gives a formula with the tree property 
whose P2 does not have the order property. This is essentially TJ^^ from [9\; basic examples 
ofTP2 will work. 

Remark 5.8. Any formula with SOP2, also called TPi, has the order property in P2. For 
SOP2, see [S]. However, the next section suggests that more precise order properties may be 
useful. 

6. Two KINDS OF ORDER PROPERTY 

Towards understanding the role of instability in the characteristic sequence, this section 
considers two polar opposite order properties and their implications for P2. 
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Definition 6.1. (Two kinds of order property) Let (Pn) be the characteristic sequence of (f. 

(1) has the n-compatible order property, for some n < uj (or n = oo) if there exist 
{ai,bi : i < uj) such that for all m < n (or m < uj), P2m{o,i-^,bj^, . . . ai^,bj^) iff 
max{zi, ...im}< min{ji, . . .j^}. 

(1) ' When the sequence has support 2 this becomes: there exist {ai, bi : i < ui) such that 

P2{au aj), P2{bi, bj) for all i,j and Psla*, bj) iff i < j. 

(2) ip has the n-empty order property, for some n E uj, if: 

there exist {ai,bi : i < uj) such that (i) P2{ai;bj) iff i < j and (ii) -iP„(ajj, . . . Oj^), 
-^Pn{bi^, ■ ■ - bij hold for a/Hi,...i„ < uj. 

Let us briefly justify not focusing on a natural tliird possibility, the "semi-compatible 
order property," in which the elements {ai : i < uj) are an empty graph and the elements 
{bi : i < uj) are a positive base set. 

Observation 6.2. There is a formula in the random graph which has the semi- compatible 
order property. 

Proof. Choose two distinguished elements 0, 1 (this can be coded without parameters). 
Deflne ipi^x; y, z) to he x = y if z = 0, xRy otherwise. Then on any sequence of distinct 
elements {aibi : i < uj) d M which witness the order property [aiRbj <^=^ i < j), we have 
additionally that 

3x (^ip{x; ai,0) A ilj{x;bj,l)) <^=^ 3x [x = ai A xRbj) i<j 

so P2 has the order property on the sequence ((ai,0), {bi, 1) : i < uj). On the other hand, 
3x{x = ai Ax = Qj) 4=^ i = j, so the row of elements (oj, 0) is a P2-empty graph. Finally, 
3x{xRbi A xRbj) always holds, by the axioms of the random graph; so the row of elements 
{bj, 1) is a Poo-complete graph. □ 

Observation 6.3. There is a formula in a simple rank 3 theory which has the 2-empty order 
property. 

Proof. Let T be the theory of two crosscutting equivalence relations, E and F, each with 
inflnitely many infinite classes and such that each intersection {x : E{a,x) A F{x,b)} is 
infinite. Let P be a unary predicate such that 

. {^x,y){E{x,y)AF{x,y) =^ P{x) ^ P{y)) 

• For s\\ n < UJ and yi, ■ ■ - ykyVk+i, ■ ■ - Vn elements of distinct -E-equivalence classes, 
there exists z such that i < k {\fx){E{x,y) A F{x,z) =^ P{^)) ci-nd k < i < 

n =^ {\/x){E{x,y) AF{x,z) =^ -^P{x))) 

Let ip{x; y, z) be E{x, y) A P{y) if z = 0, and F{x, y) A P{y) otherwise. Let (oj, bi : i < uj) 
be a sequence of elements chosen so that (yx){E{x, ai) A F{x, bj) =^ P{x)) iff ^ < j- Then 
it is easy to see ip has the 2-empty order property on the sequence {{ai, 0), {bi, 1) : i < uj). □ 

Remark 6.4. Assuming MA + 2*^'' > Ki, Shelah has constructed an ultrafilter on uj which 
saturates (small) models of the random graph, but not of theories with the tree property {[7\ 
Theorem VI.3.9). This is a strong argument for the "semi- compatible order property" being 
less complex: it cannot, by itself, imply maximality in the Keisler order, whereas we will see 
that the 00-compatible order property does. It may still be that persistence, in the sense of 
[B], of any order property in P2 creates complexity. 
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We return to the study of the compatible order property. 

Convention 6.5. When more than one characteristic sequence is being discussed, write 
Pn{<^) to indicate the nth hypergraph associated to the formula (p. Recall that ip^ is shorthand 
for Ai<i<ey^{x;yi). 

The following general principle will be useful. 

Claim 6.6. Suppose that we have a sequence C := {ci : i E Z) and a formula p(x; y, z) such 
that: 

(1) 3xp{x]Ci,Cj) i<j 

(2) 3x (A£<n Cj,, CjJ) just in case max{zi, . . . z„} < min{ji, . . . j„} 
Then p has the oo-compatible order property. 

Proof. By compactness, it is enough to show that there are elements {ai, (3i : i < n) witness- 
ing a fragment of the oo-compatible order property of size n. 

Define ai . . . an, . . . /5„ as follows. Remark [6171 provides a picture. 

• ai := C2i-iC4„_2i+i, 1 <i <n 

• (3i := C-2iC2,, 1 <i <n 

Then Pi{ai), Pi{(3i) for 1 <i <nhy (1). For all 1 < A;, r < n with r + /c = m, condition 
(2) says that Pm{ai^,. ■ ■ ■ ■ ■ Pjr) iS 

max{2£ : i e {ji, . . . j,.}} < min{2s - 1 : s G {h, . . . ik}} 

that is, iff maxjji, . . . jV} < min{ii, . . . i^}, so we are done. □ 

Remark 6.7. The oo-compatible order property decribes an interaction between two Poo- 
complete graphs, i.e. consistent types. The hypotheses (l)-(2) of Claim lUT^ are enough to 
allow a weak description of intervals. That is, we choose the sequences ai, Pi to each describe 
a concentric sequence of intervals (each ai, (3i corresponds to a set of matching parentheses) 
along the sequence {ci): 

- [-[-[-H-H-] (-(-(-(-)-)-)-) - 



which we can interlace to obtain oo-c.o.p. by judicious choice of indexing: 



In this picture, the enumeration of the as { ), would proceed from the outmost pair to the 
inmost and the enumeration of the /3s [ ] from inmost to outmost. 

Observation 6.8. Suppose that (p has the strict order property, i.e. there is an infinite 
sequence {ci : i < uj) on which 3x{-^p{x] Ci) A p{x] Cj)) <^=^ i < j ■ Then -^p{x; y) A p{x; z) 
has the oo-compatible order property. 

Proof. Writing p(x; z) = -^p{x; y) A p{x] z), 

• 3xp{x] Ci, Cj) <^==^ i < j, by definition of strict order; 

• 3x{p{x;Ci,Cj) A p{x;ck,ce)) <^=^ i,k<j,i 

and the characteristic sequence Poo{p) has support 2. Apply Claim [6l6l □ 

14 



Example 6.9. The theory T of the triangle-free random graph with edge relation R has the 
oo-c.o.p. Consider if{x; y, z) = xRy A xRz. (The negative instances could he added hut are 
not necessary.) Then: 

• Pi{{y,z)) ^ -.yRz. 

• P2i{y,z),{y',z')) iff {y,y',z,z'} is an empty graph. 

• The sequence has support 2, as the only prohlems come from a single new edge: 
PniivuZi), . . . {yn,Zn)) iff 

3x j yAy xRyi A y\ xRzj j that is, if \^yiU [_Jzj is a P2-empty graph. 

\i<n i<fi / ' j 

Let {ai, hi : i < uo) he a sequence witnessing the 2-empty order property with respect to the 
edge relation R, say aiRhj iff j < i. Then 3x{xRai A xRbj) iff i < j, i.e. {ai,bj) G Pi iff 
i < j . Also, 3x{xRai A xRbj A xRa^ A xRhf) if, in addition, i,k < j, i. Apply Claim lUTSi 

Finally, we tie the compatible order property to SOP3, a model-theoretic rigidity property. 
SOP3 will be discussed extensively in the next section, Definition \7.A\ Definition 17.51 



Lemma 6.10. Suppose that 6{x; y) has SOP3 in the sense of Definition 7.4\ so i{x) = i{y). 



Let ifr = ^, ipi = ip he the formulas from Definition 1.5. Then p{x\ y, z) := ^riUi x) Aipe{x, z 



has the 00 -compatible order property on some A' C Pi. Moreover, we can choose A' so that 
the sequence restricted to A' has support 2. 

Remark 6.11. This is an existential assertion, and it is straightforward to check that it 
remains true if we modify p to include the corresponding negative instances. 

Proof, (of Lemma) Let A := (a^ : i < Q) be an infinite indiscernible sequence from Definition 
1731 Then 

Pi{{ai,aj)) 3x {ipr{ai,x) Aipi{x,aj)) i<j 

by the choice of <f,ip. More generally, 

P„((ai,,ajJ,...(ai„,ajJ) 3x i /\(pr{x;ai,) A /\ipeix;ajj 

\t<n t<n 

which, again applying Definition l7.5l happens iff maxjzi, . . . z„} < min{ji, . . . a condition 
which has support 2. We now apply Claim [6^ to obtain A' C Ax A witnessing the compatible 
order property. Note that while (P„) need not depend on 2 elsewhere in Pi (we know very 
little about p oS A), it does depend on 2 on elements from the sequence A'. □ 

Observation 6.12. Suppose 6{x;y) has the 00-compatihle order property. Then the formula 
ip{x; y, z) := 9{x; y) A -^9{x; z) has SOP3. 

Proof. Let (ajfoj : i < u) he a sequence witnessing the 00-compatible order property; this 
will play the role of the sequence {ai : i < uj) from Definition 17.51 In the notation of 
that Definition, let ip{x;y,z) := 9{x;y) A -^9{x;z) and ip{x]y,z) := 9{x]z). We check the 
conditions. 

(1) Clearly {ip{x]y,z),ip{x]y,z)} is inconsistent. 

(3) When i > j, {ip{x; aibi),ip{x; ajbj)} = {9{x;ai) A -^9{x;bi),9{x;bj)} is inconsistent 
because -^P2{ai,bj). 
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Finally, for 1 < j < a; let pj{x) = {9{x;ai) : I < i < j} U {6{x;be) : j < i < to}. 
The oo-c.o.p. implies Pn{cLi, ■ ■ - aj, . . .hn) for all ri < cu, so pj is consistent. However, 

i < i =^ -'P2{bi,aj) so Pj,n(yX) I i9{x;bi) for each I < i < j. Choosing Cj \= pj for each 

j < uj gives (2). □ 

Remark 6.13. Applying Shelah's theorem that any theory with SOP^ is maximal in the 
Keisler order [8], [9], we conclude that if T contains a formula with the oo-compatible 
order property, then T is maximal in the Keisler order. For more on Keisler's order, see [5]. 

7. Calibrating randomness 

In this final section, we observe and explain a discrepancy between the model-theoretic 
notion of an infinite random fc-partite graph and the finitary version given by Szemeredi 
regularity, showing essentially that a class of infinitary A;-partite random graphs which do 
not admit reasonable finite approximations must have the strong order property SOP3 (a 
model-theoretic notion of rigidity. Definition 17.41 below) . 

7.1. A seeming paradox. 

Observation 7.1. Let T be the theory of the triangle-free random graph, with edge relation 
R. Then it is consistent with T that there exist disjoint infinite sets X, y, Z such that each 
pair (X, y), (y, Z), (X, Z) is a bipartite random graph. 

Proof. The construction has countably many stages. At stage 0, let Xq = {a}, Yq = {b}, Zq = 
{c} where a, b, c have no i?-edges between them. At stage i + I, let Xj+i be Xj along with 
2l^!l+l^iLmany new elements: 

(1) for each subset r C Fj, a new element Xr such that for y E Y, XrRy <^==^ V ^ 
however -^XrRx for any x previously added to Xj+i. 

(2) for each subset u C Zi, a. new element x^ such that for z E Z, x^Rz <^=^ ^ G z/, 
with Xv likewise i?-free from previous elements of Xj+i. 

Fj+i, Zi^i are defined symmetrically. As we are working in the triangle-free random graph, 
in order that the the construction be able to continue, it is enough that the sets Xj, Fj, Zi 
are each empty graphs, i.e., at no point do we ask for a triangle. 

To finish, set X = IJ^ Xj, F = |J- Fj, Z = [J- Zi. Each pair is a bipartite random graph, 
as desired. □ 

But recall: 

Theorem F. (weak version of Key Lemma, Theorem\D^ Fix 1 > 5 > and a binary edge 
relation R. Then there exist e' = e'{6), N' = N'{e', 6) such that: if e < e' , N > N' , X, Y, Z 
are disjoint finite sets of size at least N, and each of the pairs (X, Y), {Y, Z), (X, Z) is e- 
regular with density 6, then there exist xEX,yEY, z^Z so that x, y, z is an R-triangle. 

Obviously, we cannot have an i?-triangle in the triangle-free random graph. Nonetheless 
each of the pairs (X, Y) in Observation 17. II manifestly has finite subgraphs of any attainable 
density. 

The difficulty comes when we try to choose finite subgraphs X' G X,Y' G Y, Z' G Z so 
that the densities of all three pairs are simultaneously near the same 6 > 0. If (X', Y') and 
{Y', Z') are reasonably dense, (X', Z') will be near 0. Put otherwise, we may choose elements 
of X independently over Y, and independently over Z, but not both at the same time. 
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The constructions below generalize this example, and give a way of measuring the "depth" 
of independence in a constellation of sets Xi, . . . Xn, where any pair {Xi,Xj) is a bipartite 
random graph. The example of the triangle-free random graph is paradigmatic: we shall see 
that a bound on the depth of independence will produce the 3-strong order property SOP3. 

7.2. Constellations of independence properties. 

Definition 7.2. Fix a formula R{x; y). 

(1) Let A,B be disjoint sets of k- and n-tuples respectively, where k = £{x),n = i{y). 
Then A is independent over B with respect to R just in case for any two finite disjoint 
r],^ G B, there exists a & A such that b & rj ^ R{a; b) and b & u -iR{a; b). 

(2) Let Ai, . . . Ak be disjoint sets (of m-tuples, where m = i{x) = i{y))- Then A\ is 
independent over A2, ■ ■ ■ Ak with respect to R just in case Ai is independent over 
B := [J2<i<k^i sense of (2). 

(3) R{x\ y) is a bipartite random graph if there exist disjoint infinite sets A, B such that 
A and B are each independent over the other wrt R. 

(4) R{x\ y) is J™, for some 2 < k < m, if there exist disjoint infinite sets {Ai : i < m) 
such that for any distinct ii, . . .ik < uj, Ai^ is independent over IJ2<j<fc w.r.t. R. 
Note that k refers to the depth of the independence, and not the size of the finite 
disjoint rj,!/. 

Observation 7.3. Let R{x; y) be a symmetric formula. The following are equivalent. 



(2) There is an infinite subset of the monster model on which R is a random graph. 
(Certainly this need not be definable or interpretable in any way). 

Definition 7.4. (Shelah, [8]:Definition 2.5) For n > 3, the theory T has SOPn if there is 
a formula ip{x] y), i{x) = £{y) = k, M \= T and a sequence {ai : i < uj) with each ai G M*^ 
such that: 

(1) M \= ip{ai, aj) for i < j < uj 



same formula). 

(2) If T is a complete theory with SOP3, then T is maximal in the Keisler order. 

We will derive SOP3 from failures of randomness, using the following equivalent definition. 
Remember that, by convention, a^, x, . . . need not be singletons. 

Definition 7.5. ([IjiFact 1.3) T has SOP3 iff there is an indiscernible sequence {ai : i < uj) 
and C-formulas ip{x; y),ip{x] y) such that: 

(1) {ip{x]y),ip{x]y)} is contradictory. 

(2) there exists a sequence of elements {cj : j < uj) such that 



(1) R ts 




• i> j =^ i^{cj;ai) 
(3) if i < j, then {ip{x; aj),i/j{x] ai)} is contradictory. 
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The idea of the construction (Theorem 17.71) is contained in the following straightforward 
example. 

Example 7.6. Let T be the triangle free random graph, with edge relation R. Then R is /| 
but not /|, and T is SOP3. 

Proof. Let us prove the final clause (for the rest see Observation 17.11 and the discussion 
following) . 

The theory by definition contains a forbidden configuration, a triangle. Suppose A,B,C 
are disjoint infinite sets witnessing /|. Let us construct a sequence of triples S = {ai, hi, Ci : 
i < uj) such that, for i < uj, 

• For all j < i, biRuj. 

• For all j < i, CiRbj. 

• For all j < i, Qi+iRcj. 

Define a binary relation <e on triples by: 



While <e need not be a partial order on the model, it does linearly order the sequence S by 
construction. Looking towards Definition 17.51 let us define two new formulas (the variables 
t stand for triples): 



• i^{to;ti,t2) = to <i ti <i t2 

Let us check that these formulas give SOP^. For condition (1), ^9(^0; ^i, ^2), "^(^o! ^i? ^2) 
means that (xo,?/o, ^o) <e {xi,yi,zi) <i (x2,?/2, 2^2) <i {xo,yo,zo). Then XiRyj, yjRzk, ztRxi 
which gives a triangle, contradiction. 

It is straightforward to satisfy (2) by compactness (e.g. by choosing S codense in a larger 
indiscernible sequence). 

Finally, for condition (3), suppose i < j but ip{t]'~^i),ip{t]'^j) is consistent, where t = 
{x,y,z). This means that {x,y,z) <£ {ai,bi,Ci) <£ {aj,bj,Cj) <£ {x,y,z) (where the middle 
<£ comes from the behavior of <£ on the sequence 5"). As in condition (1), this gives a 
triangle, contradiction. □ 

We can extend this idea to a much larger engine for producing the rigidity of SOP3 from 
a forbidden configuration. 

Theorem 7.7. Suppose that for some 2 < n < to , the formula RofTis but not In+l- 



Proof. The construction is arranged into four stages. 
Step 1: Finding a universally forbidden configuration G. 

By hypothesis, R is not In+i- This means that the infinitary type p(Xo, . . .X„), which 
describes n + 1 infinite sets Xi which are I^Xi sense of Definition 17. 2[ is not consistent. 

Let G be a finite inconsistent subset of height /i, in the variables Vq = {x* : 1 < i < /i, < 
3 < n}, and described by the edge map Eq '■ {{{i, j), {i' , j')) : i,i' < h,j 7^ j' < n} — > 
{0, 1}. As the inconsistency of p is a consequence of T, G will be a universally forbidden 
configuration: 




• <^(to; ti, ^2) = ti <i t2 <i to 



Then T is SOP^. 
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(1) T^^i3xl...x'^)i /\ Rix],4) ^ E{{i,j),{z',f)) = l 

\i,i'<h, j^j'<n 

Note that the configuration remains agnostic on edges between elements in the same column, 
in keeping with the definition of /™. 

In what follows G will appear as a template which we shall try to approximate using I"'*'^. 
Here are the vertices of G arranged as they will be visually referenced (the edges are not 
drawn in): 

Q^/i /y*^ ry*^^ 

X„ 

rftP rf>P IfP 

•^0 -^k -^n 

Xq ... ... X„ 

Figure 1. Vertices of the forbidden configuration G, arranged in columns. 
When comparing this configuration to an array whose rows are indexed modulo 
h, the superscript of the top column becomes 0. 

Step 2: Building an array A of approximations to G. 

Let Af), . . . An he disjoint infinite sets witnessing J^"*"^ for R. As in Example 17.61 we will 
use elements from these columns Ai to build an array A = (af :l<p<uj,0<i<n). 
Fixing notation, 

• Oq, . . . is called a row. 

• Col(i) = {j : j i,i + 1 (modn + 1)} is the set of column indices associated to the 
column index i. 

• Define an ordering on pairs of indices {(3 for "before"): 

(3{{t',l'),{t,l)) ^ def 

it' <tM' e Col(i)) V it' = tM' <i) 



Claim 7.8. We may build the array A to satisfy: 

(1) For all p, a1 E A^. 

(2) For any p', p, k, k' such that P{{p', k'), (p, k)), 

aiRai ^ Ec{{r,k),{r',k')) = l 

where r = p (mod h), r' = p' (mod h). 

Proof. We choose elements in a helix (aj, a\, . . . a^, Cq, a^, . . . ) so that /3((p', k'), (p, k)) implies 
that a^, is chosen before a^. 

When the time comes to choose a^, we look for an element of A^ which satisfies Condition 
(2) of the Claim, that is, which, by Condition (1), realizes a given i?-type over disjoint 
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finite subsets of the columns Ai {i G Col(fc)). As (Aq, . . . A„) was chosen to be J""*"^ and 
I Col/c| = n — 1, an appropriate exists. □ 

Step 3: Defining the relation <£, which has no pseudo-{n + l)-loops. 

We now define a binary relation <£ on m-tuples, where m = h{n + l). Fix the enumeration 
of these tuples to agree with the natural interpretation as blocks Bi of h consecutive rows 
in the array A (see Figure 17^ . That is, write the variables Y := {yj : 1 < t < h, < i < n) , 
Z := {zf, ■.l<t'<h,0<i'<n). Define: 



/\ e Com =^ {4 R y^ ^ EG{{t,t), {t',t')) = l) 

l<t',t<h, 0<i,i'<n 



Let be a partition of the array A into blocks Bk {k < u) each consisting of h consecutive 
rows, so Bk := {al : < t < n, kh + 1 < r < (kh) + h), for each k < u (see Figure [7^ . By 
Claim [721 « S j =^ Bi <^ Bp 

Definition 7.9. A pseudo-(n + l)-loop is a sequence Wj (0 < i < n) such that for some m, 
1 < m < n: 



(2) 



{0<j<i<n) J \l<j<m / \m<j<n / 



Suppose it were consistent with T to have blocks of variables Wq . . . Wn which form a 



pseudo-(n + l)-loop. Write Wk{i) = {wf ... for the ith column of block Wi 



k- 



Figure \T7I\ gives the picture, where the elements a are replaced by variables w and the blocks 
Bi become Wi. Set Wq = W^o(O) U ■ ■ ■ U Wn{n) (which can be visualized as the boldface 
columns in Figure 17^ . 

By definition of <^, the pseudo-(n+l)-loop (j2]) implies that whenever 

(( i e Col(i)) A ((0 < j < 2 < n) V (j = A i < m) V (m < j A i = 0))) 

we will have: 

(V wi e W{t), wi e WU)) (wi R wi ^ Eaiit, k), (t', k')) = { 



In other words, <£ says that on certain pairs of elements in our proposed instance Wq of 
G, namely those elements whose respective columns "fall into each other's scope" as given 
by the Col operator, Wq faithfully follows the template of G. It is easy to check that in a 
pseudo-(n + l)-loop every pair j ^ iin {0, . . .n} has this property. Thus pseudo-(n + l)-loops 
in <£ are inconsistent with T. 

Step 4^ Obtaining SOP3. 

Step 3 showed that our array A of approximations had a certain rigidity, which we can now 
identify as SOP3. Following Definition 17.51 let us define fr{x; yi, ■ ■ ■ y-n) and ^^^(x; yi, ■ ■ ■ yn), 
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Figure 2. Elements of the array A, arranged in blocks of h rows. The bold- 
face refers to Step 4 of the proof, when a proposed witness to G is assembled 
from the ith columns of blocks Bi in a pseudo-(n + l)-loop. 
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where the the variables are blocks, and the subscripts and "r" are visual aids: the element 
X goes to the left of the elements yi under ■?/', and to their right under ip. 
That is, we set: 

• ^r{x]yi, ...yn) = 

/\ Vi <e Vj A /\ yi<iX 

• i^e{x;yi,...yn) = 

/\ X <iyi A /\ yi<i yj 

l<i<n ^l^iy^j^n- 

Now let us verify that the conditions of Definition 17.51 hold. Let B be the sequence of 
blocks defined in Step 3, and assume without loss of generality that B = {Bk : k < uj) 
is indiscernible and moreover is dense and codense in some indiscernible sequence B' . Let 
A = {Ai : i < uj) he an indiscernible sequence of n-tuples of elements of B. 

(1) {y:>r{x; yi, ■ ■ ■ ?/„), yi, ■ ■ ■ ?/„)} is contradictory because it gives rise to a pseudo- 
(n + l)-loop. 

(2) By construction, for any k < uj, the type 

{ipiix; Aj) : j < k}U {(fr{x; Ai) : k < i} 

is consistent, because <£ linearly orders B, thus also B'. Choose the desired sequence 
of witnesses to be elements in the indiscernible sequence B' which are interleaved with 
B. 

(3) Suppose we have {(pr{x; Aj),ilj£{x; Ai)} for some i < j, or in other words: 

{(frix; Bj^, . . . BjJ,ipe{x; Bi^, . . . BiJ} where < {ji,---jn} 

Then x <£ Bi^ <£■■■<£ Bi^ <e Bj^ <£■■■<£ Bj^ <^ x is a pseudo-(2?T, + l)-loop 
(remember that <e holds between any increasing pair of elements of B by construc- 
tion). Thus a fortiori we have a pseudo-(n + l)-loop, contradicting the conclusion of 
Step 3. 

We have shown that the theory T has SOP3, so we finish. □ 
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